AITopics | log-likelihood function

General bounds on the quality of Bayesian coresets

Neural Information Processing SystemsMar-22-2026, 18:34:36 GMT

Bayesian coresets speed up posterior inference in the large-scale data regime by approximating the full-data log-likelihood function with a surrogate log-likelihood based on a small, weighted subset of the data. But while Bayesian coresets and methods for construction are applicable in a wide range of models, existing theoretical analysis of the posterior inferential error incurred by coreset approximations only apply in restrictive settings---i.e., exponential family models, or models with strong log-concavity and smoothness assumptions. This work presents general upper and lower bounds on the Kullback-Leibler (KL) divergence of coreset approximations that reflect the full range of applicability of Bayesian coresets. The lower bounds require only mild model assumptions typical of Bayesian asymptotic analyses, while the upper bounds require the log-likelihood functions to satisfy a generalized subexponentiality criterion that is weaker than conditions used in earlier work. The lower bounds are applied to obtain fundamental limitations on the quality of coreset approximations, and to provide a theoretical explanation for the previously-observed poor empirical performance of importance sampling-based construction methods. The upper bounds are used to analyze the performance of recent subsample-optimize methods. The flexibility of the theory is demonstrated in validation experiments involving multimodal, unidentifiable, heavy-tailed Bayesian posterior distributions.

artificial intelligence, machine learning, neural information processing system 37, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities

Wei Qian, Yuqian Zhang, Yudong Chen

Neural Information Processing SystemsFeb-13-2026, 11:37:20 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, log-concave distribution, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.96)

Add feedback

Fully Neural Network based Model for General Temporal Point Processes

Takahiro Omi, naonori ueda, Kazuyuki Aihara

Neural Information Processing SystemsFeb-11-2026, 22:32:36 GMT

A temporal point process is a mathematical model for a time series of discrete events, which covers various applications. Recently, recurrent neural network (RNN) based models have been developed for point processes and have been found effective.

artificial intelligence, intensity function, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

4f00921114932db3f8662a41b44ee68f-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 09:56:49 GMT

However, it remains a challenge to extract reliable inference from complexdatasets with uncertainty quantification.

artificial intelligence, machine learning, zt 0, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

1dba5eed8838571e1c80af145184e515-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 18:15:48 GMT

discrimination, historical encoder, maximization step, (10 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.36)

Add feedback

4f00921114932db3f8662a41b44ee68f-Supplemental.pdf

Neural Information Processing SystemsOct-9-2025, 14:25:04 GMT

artificial intelligence, hawke process, machine learning, (15 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Uncertainty Quantification for Inferring Hawkes Networks Haoyun Wang

Neural Information Processing SystemsOct-9-2025, 14:24:57 GMT

However, it remains a challenge to extract reliable inference from complex datasets with uncertainty quantification.

artificial intelligence, hawke process, machine learning, (14 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Fully Neural Network based Model for General Temporal Point Processes

Takahiro Omi, naonori ueda, Kazuyuki Aihara

Neural Information Processing SystemsOct-2-2025, 13:32:29 GMT

A temporal point process is a mathematical model for a time series of discrete events, which covers various applications. Recently, recurrent neural network (RNN) based models have been developed for point processes and have been found effective. RNN based models usually assume a specific functional form for the time course of the intensity function of a point process (e.g., exponentially decreasing or increasing with the time since the most recent event). However, such an assumption can restrict the expressive power of the model. We herein propose a novel RNN based model in which the time course of the intensity function is represented in a general manner. In our approach, we first model the integral of the intensity function using a feedforward neural network and then obtain the intensity function as its derivative. This approach enables us to both obtain a flexible model of the intensity function and exactly evaluate the log-likelihood function, which contains the integral of the intensity function, without any numerical approximations. Our model achieves competitive or superior performances compared to the previous state-of-the-art methods for both synthetic and real datasets.

artificial intelligence, intensity function, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities

Wei Qian, Yuqian Zhang, Yudong Chen

Neural Information Processing SystemsAug-19-2025, 22:57:26 GMT

Understanding the convergence property of EM is highly nontrivial due to the non-convexity of the negative log-likelihood function.

algorithm, convergence, log-concave distribution, (14 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.96)

Add feedback

Learning Overspecified Gaussian Mixtures Exponentially Fast with the EM Algorithm

Assylbekov, Zhenisbek, Legg, Alan, Pak, Artur

arXiv.org Machine LearningJun-16-2025

We investigate the convergence properties of the EM algorithm when applied to overspecified Gaussian mixture models -- that is, when the number of components in the fitted model exceeds that of the true underlying distribution. Focusing on a structured configuration where the component means are positioned at the vertices of a regular simplex and the mixture weights satisfy a non-degeneracy condition, we demonstrate that the population EM algorithm converges exponentially fast in terms of the Kullback-Leibler (KL) distance. Our analysis leverages the strong convexity of the negative log-likelihood function in a neighborhood around the optimum and utilizes the Polyak-Łojasiewicz inequality to establish that an $ε$-accurate approximation is achievable in $O(\log(1/ε))$ iterations. Furthermore, we extend these results to a finite-sample setting by deriving explicit statistical convergence guarantees. Numerical experiments on synthetic datasets corroborate our theoretical findings, highlighting the dramatic acceleration in convergence compared to conventional sublinear rates. This work not only deepens the understanding of EM's behavior in overspecified settings but also offers practical insights into initialization strategies and model design for high-dimensional clustering and density estimation tasks.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

2506.1185

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Asia > Middle East > Jordan (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Filters

Collaborating Authors

log-likelihood function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

General bounds on the quality of Bayesian coresets

Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities

Fully Neural Network based Model for General Temporal Point Processes

4f00921114932db3f8662a41b44ee68f-Paper.pdf

1dba5eed8838571e1c80af145184e515-Supplemental.pdf

4f00921114932db3f8662a41b44ee68f-Supplemental.pdf

Uncertainty Quantification for Inferring Hawkes Networks Haoyun Wang

Fully Neural Network based Model for General Temporal Point Processes

Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities

Learning Overspecified Gaussian Mixtures Exponentially Fast with the EM Algorithm